Event Detection in Twitter using Aggressive Filtering and Hierarchical Tweet Clustering

نویسندگان

  • Georgiana Ifrim
  • Bichen Shi
  • Igor Brigadir
چکیده

Twitter has become as much of a news media as a social network, and much research has turned to analyzing its content for tracking real-world events, from politics to sports and natural disasters. This paper describes the techniques we employed for the SNOW Data Challenge 2014, described in [Pap14]. We show that aggressive filtering of tweets based on length and structure, combined with hierarchical clustering of tweets and ranking of the resulting clusters, achieves encouraging results. We present empirical results and discussion for two different Twitter streams focusing on the US presidential elections in 2012 and the recent events about Ukraine, Syria and the Bitcoin, in February 2014.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Recency is good: expanding with fresh news improves event detection in Twitter

Twitter is a popular microblogging site that is a good source of real-time information. Detecting events in Twitter is an ongoing research effort and a fundamental task is clustering tweets according to which (news) event they describe. Document expansion can improve this clustering, especially for Twitter, given that tweets are short. While document expansion using external corpora has been ar...

متن کامل

A Probabilistic Model of Real Time Event Detection and Reporting

-A probabilistic model provides a way to detect multiple instances of real time events and to estimate the location of targeted event like earthquakes, typhoons, traffic jams. For this, two models have been proposed named temporal and spatial models to detect real time events and estimate the targeted event locations respectively by dealing with sensor reading appropriately. our work is based o...

متن کامل

Earthquake Reporting System by Using Real Time Nature of Twitter

TWITTER, a popular microblogging service, an important characteristic of Twitter is its real-time nature. We analyze the real-time interaction of events such as earthquakes in Twitter and propose an algorithm to monitor tweets and to detect a target event. To detect a target event, we devise a classifier of tweets based on features such as the keywords in a tweet, the number of words, and their...

متن کامل

Cluster-discovery of Twitter messages for event detection and trending

Social media data carries abundant hidden occurrences of real-time events in the world which raises the demand for efficient event detection and trending system. The Locality Sensitive Hashing (LSH) technique is capable of processing the large-scale big datasets. In this thesis, a novel framework is proposed for detecting and trending events from tweet clusters presence in Twitter 1 dataset tha...

متن کامل

TwitterNews: Real time event detection from the Twitter data stream

Research in event detection from the Twitter streaming data has been gaining momentum in the last couple of years. Although such data is noisy and often contains misleading information, Twitter can be a rich source of information if harnessed properly. In this paper, we propose a scalable event detection system, TwitterNews, to detect and track newsworthy events in real time from Twitter. Twitt...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014